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Abstract:  Palantir  is  a  configuration  management  workspace  awareness  tool  that  continuously  informs  developers 

of  the  changes  that  are  made  in  parallel  by  other  developers  in  other  workspaces.  In  order  to  achieve  its 
goal  of  reducing  the  number  of  merge  conflicts  when  developers  commit  their  artifacts,  Palantir 
deliberately  breaks  traditional  workspace  isolation  in  order  to  promote  better  coordination  of  parallel 
activities.  In  this  paper  we  examine  four  different  visualizations  that  developers  can  use  for  visualizing 
the  activities  in  other  workspaces.  We  discuss  their  strengths  and  weaknesses,  role  within  Palantir,  and 
opportunities  for  future  improvements. 
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Abstract:  Palantir  is  a  configuration  management  workspace  awareness  tool  that  con¬ 

tinuously  informs  developers  of  the  changes  that  are  made  in  parallel  by  other 
developers  in  other  workspaces.  In  order  to  achieve  its  goal  of  reducing  the 
number  of  merge  conflicts  when  developers  commit  their  artifacts,  Palantir  de¬ 
liberately  breaks  traditional  workspace  isolation  in  order  to  promote  better  co¬ 
ordination  of  parallel  activities.  In  this  paper  we  examine  four  different  visu¬ 
alizations  that  developers  can  use  for  visualizing  the  activities  in  other  work¬ 
spaces.  We  discuss  their  strengths  and  weaknesses,  role  within  Palantir,  and 
opportunities  for  future  improvements. 

Key  words:  Configuration  management,  workspace  awareness,  workspace  visualization, 
coordination 


1.  INTRODUCTION 

At  its  core,  a  configuration  management  system  is  geared  towards  help¬ 
ing  developers  coordinate  their  activities.  Most  configuration  management 
systems  do  so  by  separating  the  overall  development  effort  over  multiple, 
isolated  workspaces,  such  that  one  developer’s  changes  do  not  affect  those 
of  another  developer.  This  kind  of  isolation  works  fine  when  work  is  divided 


1 


2 


Anita  Sarma  and  Andre  van  der  Hoek 


among  developers  in  a  mutually  exclusive  manner.  In  that  (ideal)  case,  there 
will  be  no  conflict  when  a  developer  places  their  changes  back  in  the  central 
repository.  Unfortunately,  the  ideal  case  can  normally  not  be  achieved  and 
consequently  changes  made  by  different  developers  in  different  workspaces 
regularly  conflict  with  each  other  [7,8,13].  These  conflicting  changes  lead  to 
integration  problems  that  often  must  be  manually  resolved  [10]. 

Palantir  is  a  configuration  management  workspace  awareness  tool  that  is 
based  on  the  hypothesis  that  being  aware  of  each  other’s  workspace  activi¬ 
ties  enables  developers  to  better  coordinate  their  parallel  changes  and  lessen 
the  number  of  conflicts  that  will  occur.  In  effect,  Palantir  is  based  on  the 
premise  of  early  detection:  rather  than  discovering  that  two  changes  are  in 
conflict  at  the  moment  of  committing  the  second  change,  Palantir  helps  de¬ 
velopers  uncover  potential  conflicts  as  they  are  making  their  changes  in  their 
respective  workspaces.  Palantir  shares  this  goal  with  several  other  research 
approaches  [1,4,5,9,11,12],  but  distinguishes  itself  by  (a)  being  a  generic 
infrastructure  that  can  plug  into  any  configuration  management  system  and 
(b)  providing  a  rich  set  of  information  that  can  be  displayed  in  a  variety  of 
different  ways.  To  demonstrate  the  first  point,  Palantir  has  been  integrated 
with  RCS  [15],  CVS  [2],  and  Subversion  [16].  The  second  point  is  high¬ 
lighted  by  the  fact  that  Palantir’ s  different  visualizations  not  only  inform  a 
developer  of  which  other  developers  change  which  other  artifacts,  but  do  so 
in  a  pair-wise  fashion  while  presenting  a  measure  of  severity  of  the  changes. 

In  our  previous  work  [14],  we  described  Palantir  in  terms  of  its  goals,  ar¬ 
chitecture,  and  implementation,  and  discussed  our  experience  in  integrating 
Palantir  with  two  different  configuration  management  systems.  Furthermore, 
we  showed  how  Palantir  addresses  such  concerns  as  scalability,  flexibility, 
and  configurability.  One  of  the  observations  resulting  from  the  work,  though, 
is  that  different  users  prefer  different  kinds  of  visualizations  that  present 
them  with  different  kinds  and  amounts  of  information.  While  Palantir’ s  ar¬ 
chitecture  supports  the  incorporation  of  many  different  kinds  of  visualiza¬ 
tions,  we  had  only  prototyped  two  such  visualizations.  In  this  paper,  we  de¬ 
scribe  our  ongoing  work  in  further  refining  those  two  existing  visualizations 
as  well  as  two  new  visualizations  that  we  have  developed  in  response  to  a 
few  brief  user  interviews. 

The  remainder  of  the  paper  is  organized  as  follows.  In  Section  2,  we 
briefly  reiterate  the  overall  architecture  and  approach  of  Palantir.  Section  3 
introduces  our  four  visualizations  and  discusses  their  strengths  and  weak¬ 
nesses.  We  conclude  in  Section  4  with  an  outlook  at  our  future  work. 
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2.  APPROACH 

Palantir  itself  is  not  a  configuration  management  system.  Rather,  it  com¬ 
plements  and  does  not  interfere  with  existing  configuration  management  sys¬ 
tems  by  only  focusing  on  collecting,  distributing,  organizing,  and  presenting 
relevant  workspace  information. 

Figure  0  presents  the  architecture  of  Palantir.  The  bottom  six  grey  com¬ 
ponents  represent  components  traditionally  found  in  configuration  manage¬ 
ment  systems;  they  are  used  unchanged.  The  middle  grey  component  repre¬ 
sents  the  Siena  event  notification  service  [3],  which  Palantir  uses  to  broad¬ 
cast  and  filter  events  pertaining  to  activities  in  other  workspaces.  The  white 
components  are  Palantir  components  that  incrementally  implement  its  func¬ 
tionality.  Arrows  represent  information  flow.  We  briefly  discuss  the  role  and 
functionality  of  each  of  the  Palantir  components  in  the  following. 

•  A  Workspace  wrapper  intercepts  relevant  workspace  activities  and 
emits  events  regarding  their  occurrences  (e.g.,  artifact  is  placed  in 
workspace,  artifact  has  undergone  changes,  artifact  is  placed  back  in 
repository).  Workspaces  and  their  access  mechanisms  differ  per  con¬ 
figuration  management  system.  Therefore,  each  wrapper  is  built  for 
one  specific  configuration  management  system  and  translates  its  par¬ 
ticular  workspace  conventions  to  standard  Palantir  events. 

•  An  internal  state  component  collects,  preprocesses,  and  stores  all  the 
relevant  events  from  both  the  local  workspace  and  the  remote  work¬ 
spaces,  thereby  creating  an  organized  overview  of  workspace  activi- 


Figure  1 .  Palantir  Architecture. 
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ties.  In  order  not  to  maintain  a  view  of  all  workspaces,  the  compo¬ 
nents  leverages  information  regarding  artifacts  in  the  local  work¬ 
space  to  only  subscribe  to  Siena  events  pertaining  to  workspaces  that 
also  operate  on  those  artifacts.  This  significantly  reduces  the  number 
of  events  that  are  received,  thereby  reducing  the  eventual  cognitive 
burden  on  the  user. 

•  An  extractor  allows  a  developer  to  select  a  subset  of  events  that  they 
may  want  to  view.  Often,  human  knowledge  can  help  in  deciding 
what  may  and  may  not  be  relevant  events  (e.g.,  it  may  have  already 
been  agreed  upon  that  developer  Joe  will  not  make  any  changes  to 
certain  artifacts).  Palantir  supports  such  human  input  by  providing 
an  extensive  selection  mechanism  through  which  a  user  may  exer¬ 
cise  their  preferences  as  to  in  which  events  they  are  interested  (e.g., 
based  on  type  of  events,  author,  time  period,  and/or  severity). 

•  A  visualization  is  responsible  for  organizing  and  displaying  the  ac¬ 
tivities  as  they  occur  in  parallel  in  different  workspaces.  Only  events 
that  pass  the  extractor  are  visualized.  We  have  developed  four  visu¬ 
alizations  thus  far  -  a  ticker  tape  visualization,  a  tabular  visualiza¬ 
tion,  an  explorer  visualization,  and  a  fully  graphical  visualization. 

Using  the  above  components,  Palantir  continuously  shares  information  with 
developers  regarding  the  activities  of  other  developers.  Rather  than  develop¬ 
ers  having  to  proactively  obtain  limited  information  directly  from  the  reposi¬ 
tory,  Palantir  automatically  provides  them  with  workspace  awareness  in  the 
form  of  an  accurate,  complete,  and  always  up-to-date  picture  of  the  ongoing 
activities  in  other  workspaces. 


3.  VISUALIZATIONS 

The  architecture  of  Palantir  is  purposely  designed  to  support  multiple, 
visualizations.  Different  users  typically  have  different  desires  and  ways  of 
working,  which  should  be  accommodated  by  giving  them  the  option  to  oper¬ 
ate  with  a  view  that  supports  their  working  style.  Thus  far,  we  have  devel¬ 
oped  four  different  visualizations:  a  ticker  tape  visualization,  a  tabular  visu¬ 
alization,  an  explorer  visualization,  and  a  fully  graphical  visualization.  We 
discuss  each  of  these  visualizations  below. 

3.1  Ticker  tape  visualization 


The  first  visualization  is  a  simple  scrolling  marquee  that  is  similar  to  the 
one  provided  by  Elvin  [6].  Shown  in  Figure  0,  the  ticker  tape  scrolls  one-by- 
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Figure  2.  Ticker  Tape  Visualization. 

one  through  the  set  of  events  as  they  occur  in  each  of  the  workspaces.  Events 
can  be  sorted  as  desired,  per  author,  event  type,  or  severity.  Clearly,  if  too 
many  events  scroll  by  the  ticker  tape  loses  its  effectiveness.  Therefore,  the 
ticker  tape  best  serves  in  the  role  of  an  alert  mechanism.  Using  the  extractor 
component,  a  developer  should  set  a  relatively  high  threshold  for  the  severity 
of  the  events  they  want  to  see.  As  a  result,  the  scrolling  marquee  will  gener¬ 
ally  be  empty  but  when  it  is  not,  a  developer  is  informed  of  a  likely  serious 
situation  that  warrants  further  investigation.  Any  of  the  other  visualizations 
can  then  be  used  to  understand  the  details  of  that  situation. 


3.2  Tabular  visualization 


Shown  in  Figure  0  is  the  second  visualization,  which  presents  informa¬ 
tion  in  tabular  form.  In  this  visualization,  the  artifacts  in  the  local  workspace 
are  shown  on  the  left  hand  side  as  organized  in  an  expandable  tree.  A  devel¬ 
oper  can  selectively  open  those  artifacts  in  which  they  are  interested  and 
view  a  detailed  summary  of  all  relevant  activities  in  all  workspaces.  For  in¬ 
stance,  the  artifact  “spell”  is  currently  present  in  three  workspaces,  and  has 
undergone  some  changes  in  two  of  those  workspaces.  Examining  a  cell  in 


Figure  3.  Tabular  Visualization. 


6 


Anita  Sarma  and  Andre  van  der  Hoek 


the  table  gives  detailed  information.  For  instance,  “spell”  is  currently  in  the 
workspaces  of  Mike,  Pete,  and  Allen.  Note  that  columns  can  be  reordered  in 
order  for  a  developer  to  have  their  preferred  information  presented  first. 

The  tabular  view  is  helpful  in  providing  a  cursory  glance  at  the  activities 
in  other  workspaces.  It  is  more  detailed  than  the  ticket  tape  visualization,  and 
focuses  more  on  providing  a  cumulative  view  rather  information  regarding 
individual  events.  As  in  the  ticker  tape  visualization,  if  a  developer  feels  the 
need  for  further  investigation  of  a  particular  troublesome  situation  they  can 
revert  to  the  fully  graphical  visualization. 

We  are  currently  examining  several  enhancements  to  this  visualization. 
In  particular,  we  would  like  to  add  highlighting  to  draw  attention  to  particu¬ 
lar  conditions.  For  instance,  based  on  a  user-defined  criterion  of  highlighting 
all  artifacts  with  a  severity  greater  than  fifty  percent,  the  visualization  would 
underline  each  artifact  that  matches  that  condition  and  place  a  red  border 
around  its  severity  field. 

3.3  Explorer  visualization 

The  third  visualization  is  based  on  the  tabular  visualization,  but  instead  of 
using  numerical  tallies  it  introduces  graphical  elements  for  highlighting  the 
presence  of  potential  conflicts.  Shown  in  Figure  0,  the  left  hand  side  once 
again  is  an  expandable  tree  view.  In  the  explorer  view,  however,  the  tree 
view  is  enhanced  with  vertical  bars  indicating  the  severity  of  ongoing  and 


Figure  4.  Explorer  Visualization. 
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committed  changes:  the  longer  the  bar,  the  higher  the  severity  of  the  change. 
Although  not  visible  in  this  black  and  white  view,  changes  are  color  coded  to 
distinguish  changes  in  a  local  workspace  from  changes  in  other  workspaces. 

Clicking  on  the  name  of  an  artifact  presents  the  history  of  that  artifact  in 
each  of  the  workspaces.  For  instance,  “undo.c”  has  moved  through  three  ver¬ 
sions  in  Pete’s  workspace,  one  in  Ellen’s,  three  in  Steve’s,  and  two  in 
Mike’s.  Note  that  the  status  of  each  version  is  listed  (changes  are  in  progress 
or  changes  have  been  committed)  along  with  a  severity  and  change  impact 
measure.  This  helps  a  developer  in  gauging  whether  they  should  contact  the 
other  developer  to  enter  a  discussion  in  order  to  avoid  future  conflicts.  Click¬ 
ing  on  a  single  version  of  an  artifact  brings  up  detailed  metadata  regarding  a 
change. 


3.4  Fully  graphical  visualization 

The  last  visualization  is  a  fully  graphical  visualization  that  presents  a  de¬ 
veloper  with  a  hierarchical  view  of  an  artifact  and  its  constituents  (in  this 
case  version  1.1  of  the  folder  “home/word”).  Each  constituent  artifact  may 
itself  contain  other  artifacts  and  each  artifact  in  the  view  may  exhibit  multi¬ 
ple  versions  (as  indicated  by  stacks  of  artifacts).  The  visualization  acts  like  a 
web  browser  and  lets  a  user  zoom  in  or  zoom  out  of  the  hierarchy  by  double 


Figure  5.  Fully  Graphical  Visualization 
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clicking  artifacts  and  pressing  a  back  button,  respectively.  A  user,  thus,  can 
monitor  the  state  of  other  workspaces  at  a  high  level  and  zoom  in  to  explore 
potential  problems  that  may  be  present. 

Color-coding  separates  different  workspaces.  For  instance,  the  stack  for 
the  artifact  “/home/word/spell”  indicates  that  Ellen,  Pete,  and  Mike  each 
have  a  version  of  the  artifact  in  their  workspace.  Pete  and  Mike  each  have 
version  1.0  in  their  workspace,  and  their  changes  are  still  in  progress  as  indi¬ 
cated  by  the  question  mark.  Ellen,  on  the  other  hand,  already  has  checked  in 
a  new  version  of  the  artifact  (as  indicated  by  the  exclamation  mark),  result¬ 
ing  in  her  having  version  1.1  in  her  workspace. 

Artifacts  are  sorted  per  their  severity,  making  it  easy  to  monitor  one  part 
of  the  visualization  for  the  largest  conflicts.  Severity  is  indicated  by  the  pro¬ 
gress  bar:  the  fuller  the  bar,  the  higher  the  severity.  Graphical  icons  (not 
shown  in  the  figure)  help  in  highlighting  several  other  types  of  workspace 
changes  besides  a  user  modifying  an  artifact.  In  particular,  the  icons  identify 
new  artifacts,  artifacts  that  have  been  deleted,  and  artifacts  that  have  moved 
from  one  location  in  a  workspace  to  another.  These  kinds  of  changes  usually 
have  quite  a  bit  of  impact  on  the  overall  project,  hence  our  special  treatment. 

An  important  aspect  of  our  visualization  is  that  it  shows  pair-wise  con¬ 
flicts.  In  the  view  for  the  local  user  it  shows  all  conflicts,  but  in  a  view  for  a 
remote  user,  it  shows  only  the  conflicts  between  that  remote  user  and  the 
local  user.  This  makes  it  much  easier  to  locate  and  understand  the  impact  of 
potential  conflicts. 


4.  CONCLUSIONS 

One  of  the  main  functionalities  of  a  configuration  management  system  is 
to  shield  developers  from  the  effects  of  other  developers’  changes.  This,  un¬ 
fortunately,  limits  the  insight  that  a  developer  has  into  the  overall  ongoing 
state  of  a  project.  To  overcome  this  situation,  Palantir  continuously  shares 
information  regarding  the  activities  taking  place  in  local  and  remote  work¬ 
spaces.  By  letting  developers  choose  from  a  variety  of  visualizations  that 
inform  them  of  who  is  modifying  which  artifacts  in  parallel,  Palantir  com¬ 
plements  current  configuration  management  system  functionality  with  sup¬ 
port  for  the  human  identification  and  intervention  of  potential  conflicts. 

Our  next  steps  are  to  empirically  validate  Palantir  and  its  visualizations. 
In  particular,  we  would  like  to  determine  the  tradeoffs  among  the  various 
visualizations  and  understand  where  they  may  be  improved  and  what  addi¬ 
tional  information  may  need  to  be  present  for  them  to  be  truly  effective.  One 
particular  dimension  we  are  currently  exploring  is  to  add  a  measure  of 
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change  impact  (determining  the  impact  of  a  change  on  a  workspace)  to  our 
present  measure  of  severity  (which  determines  the  size  of  a  change  only). 
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